AITopics | training recurrent network

Training recurrent networks to generate hypotheses about how the brain solves hard navigation problems

Neural Information Processing SystemsNov-21-2025, 15:22:47 GMT

Self-localization during navigation with noisy sensors in an ambiguous world is computationally challenging, yet animals and humans excel at it. In robotics, {\em Simultaneous Location and Mapping} (SLAM) algorithms solve this problem through joint sequential probabilistic inference of their own coordinates and those of external spatial landmarks. We generate the first neural solution to the SLAM problem by training recurrent LSTM networks to perform a set of hard 2D navigation tasks that require generalization to completely novel trajectories and environments. Our goal is to make sense of how the diverse phenomenology in the brain's spatial navigation circuits is related to their function. We show that the hidden unit representations exhibit several key properties of hippocampal place cells, including stable tuning curves that remap between environments. Our result is also a proof of concept for end-to-end-learning of a SLAM algorithm using recurrent networks, and a demonstration of why this approach may have some advantages for robotic SLAM.

artificial intelligence, deep learning, machine learning, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.84)

Add feedback

Professor Forcing: A New Algorithm for Training Recurrent Networks

Neural Information Processing SystemsNov-21-2025, 14:21:18 GMT

The Teacher Forcing algorithm trains recurrent networks by supplying observed sequence values as inputs during training and using the network's own one-step-ahead predictions to do multi-step sampling. We introduce the Professor Forcing algorithm, which uses adversarial domain adaptation to encourage the dynamics of the recurrent network to be the same when training the network and when sampling from the network over multiple time steps. We apply Professor Forcing to language modeling, vocal synthesis on raw waveforms, handwriting generation, and image generation. Empirically we find that Professor Forcing acts as a regularizer, improving test likelihood on character level Penn Treebank and sequential MNIST. We also find that the model qualitatively improves samples, especially when sampling for a large number of time steps. This is supported by human evaluation of sample quality. Trade-offs between Professor Forcing and Scheduled Sampling are discussed. We produce T-SNEs showing that Professor Forcing successfully makes the dynamics of the network during training and sampling more similar.

new algorithm, professor forcing, training recurrent network, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Professor Forcing: A New Algorithm for Training Recurrent Networks

Neural Information Processing SystemsFeb-11-2025, 18:43:37 GMT

The Teacher Forcing algorithm trains recurrent networks by supplying observed sequence values as inputs during training and using the network's own one-step-ahead predictions to do multi-step sampling. We introduce the Professor Forcing algorithm, which uses adversarial domain adaptation to encourage the dynamics of the recurrent network to be the same when training the network and when sampling from the network over multiple time steps. We apply Professor Forcing to language modeling, vocal synthesis on raw waveforms, handwriting generation, and image generation. Empirically we find that Professor Forcing acts as a regularizer, improving test likelihood on character level Penn Treebank and sequential MNIST. We also find that the model qualitatively improves samples, especially when sampling for a large number of time steps.

new algorithm, professor forcing, training recurrent network, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.44)

Add feedback

Reviews: Professor Forcing: A New Algorithm for Training Recurrent Networks

Neural Information Processing SystemsJan-20-2025, 07:03:32 GMT

The idea in this paper is interesting and well motivated. When training an RNN for generation, using the likelihood of the observed data is not the proper criterion. Even though the problem and approach are interesting, I find the description of the training objective (Sec. Precise remarks follow: - The random variable y is present in two expectations in Eq. (1). Given an RNN with a given sequence of inputs, all the y can be computed.

new algorithm, professor forcing, training recurrent network, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.33)

Add feedback

Reviews: Training recurrent networks to generate hypotheses about how the brain solves hard navigation problems

Neural Information Processing SystemsOct-7-2024, 22:28:03 GMT

An LSTM recurrent neural network is trained to perform simultaneous localization and mapping (SLAM) tasks, given noisy odometry data and occasional input from contacts with walls. The paper is well written and I find it interesting to study recurrent neural network solutions of SLAM. But the idea of using LSTM-RNNs for SLAM problems is not new (as the authors mention in the discussion, line 256) and the results do not really surprise me. Given the amount of training trials, the network has presumably seen sufficiently many trajectories (and variations of the environment) to perform well on most test examples (and if the difference between training and testing error is too large, follow the standard recipes and tune the number of hidden neurons and the number of training trials). A more interesting result would be it the LSTM-RNN would generalize to out of distribution samples, i.e. if its performance would be comparable to that of a PF SLAM in totally different environments than the ones seen during training.

brain solve hard navigation problem, generate hypothesis, training recurrent network, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Professor Forcing: A New Algorithm for Training Recurrent Networks

Lamb, Alex M., GOYAL, Anirudh Goyal ALIAS PARTH, Zhang, Ying, Zhang, Saizheng, Courville, Aaron C., Bengio, Yoshua

Neural Information Processing SystemsFeb-14-2020, 16:13:37 GMT

The Teacher Forcing algorithm trains recurrent networks by supplying observed sequence values as inputs during training and using the network's own one-step-ahead predictions to do multi-step sampling. We introduce the Professor Forcing algorithm, which uses adversarial domain adaptation to encourage the dynamics of the recurrent network to be the same when training the network and when sampling from the network over multiple time steps. We apply Professor Forcing to language modeling, vocal synthesis on raw waveforms, handwriting generation, and image generation. Empirically we find that Professor Forcing acts as a regularizer, improving test likelihood on character level Penn Treebank and sequential MNIST. We also find that the model qualitatively improves samples, especially when sampling for a large number of time steps.

new algorithm, professor forcing, training recurrent network, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

Training recurrent networks to generate hypotheses about how the brain solves hard navigation problems

Kanitscheider, Ingmar, Fiete, Ila

Neural Information Processing SystemsFeb-14-2020, 15:41:49 GMT

Self-localization during navigation with noisy sensors in an ambiguous world is computationally challenging, yet animals and humans excel at it. In robotics, {\em Simultaneous Location and Mapping} (SLAM) algorithms solve this problem through joint sequential probabilistic inference of their own coordinates and those of external spatial landmarks. We generate the first neural solution to the SLAM problem by training recurrent LSTM networks to perform a set of hard 2D navigation tasks that require generalization to completely novel trajectories and environments. Our goal is to make sense of how the diverse phenomenology in the brain's spatial navigation circuits is related to their function. We show that the hidden unit representations exhibit several key properties of hippocampal place cells, including stable tuning curves that remap between environments.

brain solve hard navigation problem, recurrent network, training recurrent network, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback

Filters

Collaborating Authors

training recurrent network

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Training recurrent networks to generate hypotheses about how the brain solves hard navigation problems

Professor Forcing: A New Algorithm for Training Recurrent Networks

Professor Forcing: A New Algorithm for Training Recurrent Networks

Reviews: Professor Forcing: A New Algorithm for Training Recurrent Networks

Reviews: Training recurrent networks to generate hypotheses about how the brain solves hard navigation problems

Professor Forcing: A New Algorithm for Training Recurrent Networks

Training recurrent networks to generate hypotheses about how the brain solves hard navigation problems